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RECURSIVE  NONLINEAR  FILTERING  THROUGH 


COMPLETE  LINEARIZATION 

Timothy  F.  Dyson 
444  Jacksonville  Road 
Warminster,  PA  18974 

ABSTRACT 

Recursive  filtering  forms  the  foundation  of  all  tracking  and  position 
estimation  tasks  when  the  target  undergoes  dynamic  motion  relative  to  the 
observer.  Moreover,  recursive  filtering  has  an  important  role  in  stochastic 
signal  detection.  The  typical  application  almost  invariably  involves  dynamics 
equations  which  depend  nonlinearly  on  the  physical  variables  and  sometimes 
even  the  sensor  (observation)  equation  involves  a  nonlinear  relation  to  the 
quantities  to  be  estimated.  In  these  common  situations  the  practitioner 
nearly  always  resorts  to  a  form  of  approximate  linearization  of  the  describing 
equations  to  allow  the  use  of  the  now  classical  Kalman-Bucy  filter  equations. 
Although  highly  successful  in  many  useful  applications,  it  is  commonly  recog¬ 
nized  that  a  variety  of  problems  simply  will  not  satisfactorily  succumb  to 
the  usual  "Extended  Kalman"  methods.  For  these  situations,  current  research 
in  nonlinear  filtering  theory  hold  major  portent.  Indeed,  the  new  understand¬ 
ing  of  nonLinear  problems  developed  over  the  past  10  years  leads  to  optimism 
for  better,  more  refined  approximate  filtering  techniques  which  may  prove  to 
be  both  accurate  and  more  widely  applicable. 

Alternatively,  dynamics  equations,  which  naturally  formulated  are  non¬ 
linear,  may  sometimes  be  maneuvered  into  a  form  which  is  exactly  linear. 
This  phenomenon  has  been  well-known  in  deterministic  problems  for  a  long 
time.  That  the  same  situations  might  arise  in  a  stochastic  setting  seemed 


to  be  obvious  but  was  never  characterized  in  detail  except  for  the  simplest 
of  situations.  In  this  report  such  system  state  variable  equations  are 
completely  characterized  and  a  method  for  converting  them  to  an  equivalent 
linear  realization  given.  In  addition,  very  simple  tests  are  developed 
to  determine  when  a  nonlinear  stochastic  differential  equation  of  the 
diffusion  type  can  be  completely  linearized  by  these  methods. 

Along  the  way  it  is  pointed  out  that,  under  the  right  conditions,  although 
highly  improbable  in  real-world  situations,  certain  nonlinear  filtering 
problems  become  completely  linear  ones  by  these  tests  and  methods  and  hence 
yield  to  the  well-known,  exact  Kalman-Bucy  filtering.  Beyond  these  trivial 
and  rather  contrived  situations,  the  existence  of  linearlzable  equations  of 
"motion'*  suggest  potentially  more  accurate  approximate  filters  based  on  the 
transformed  state  variables.  One  such  situation  is  investigated  here  and  it 
is  shown  empirically  that  lower  estimation  error  is  obtained  from  this  tech¬ 
nique  over  an  alternative  extended  Kalman  filter  for  the  same  real-world 
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CHAPTER  1 
INTRODUCTION 

This  report  addresses  the  problem  of  sample  path  estimation  of  partially 
observed  stochastic  processes.  Also  referred  to  as  filtering,  this  problem 
arises  in  diverse  applications:  communication,  economics,  radar  and  sonar. 
Specifically,  this  report  considers  an  approximate  filtering  approach  for 
problems  in  which  the  state  variables  are  governed  by  certain  nonlinear 
stochastic  differential  equations  which  are  closely  related  to  a  linear 
stochastic  differential  equation.  The  suggested  approximation  scheme  is 
based  on  developing  extended  Kalman  filters  from  the  related  linear  repre¬ 
sentation  to  avoid  the  sometimes  highly  nonlinear  functionals  occurring  in  the 
nonlinear  state  equations. 

Although  completely  linearizable  deterministic  nonlinear  differential 
equations  are  well-known  [7]  and  completely  characterized  [8,9],  and  equiva¬ 
lent  linearization  ideas  for  stochastic  nonlinear  differential  equations 
surmised  for  some  time  [10,11]  it  has  remained  to  specifically  describe  and 
characterize  in  detail  the  conditions  under  which  this  transformation  from 
nonlinear  representation  to  linear  representation  would  occur  in  the  stoch¬ 
astic  setting.  Chapter  2  accomplishes  this  detailed  characterization  for  all 
time  invariant  nonlinear  stochastic  differential  equations.  Chapter  3  studies 
an  approximate  filtering  application  using  the  equivalent  linear  represen¬ 
tation  ideas  in  an  amplitude  and  phase  tracking  problem. 

The  Filtering  Problem 

The  general  filtering  problem,  a  specialization  of  which  is  dealt  with 
in  this  report,  is  expressed  in  the  following  way: 


Let  (fi,  F,  P)  be  a  fixed  underlying  probability  space  on  which  there  is 
an  increasing  family  of  o-fields,  (F  ,  0£t£T}.The  signal  or  state  process, 
{Xj.,  t>0),  is  F  measurable  for  each  t  e[0,  T]  and  is  assumed  to  be  a 
homogeneous  Markov  process  and  a  diffusion  process  satisfying  a  stochastic 


differential  equation  of  the  form, 


dxt  =  f(xt)dt  +  g(xt>dwt  (1.1) 

where,  xt  e  Rn;  £  Fm  is  a  vector  valued  standard  Brownian  motion  process 
defined  on  (fi,  F,  P) .  The  process  {x^,  t^0}has  generator  L,  where 

£<t>00  =  (I  fi(x)  3^7 ^  +  2  1  (  1  gik(x)gjk(x^3x.ax.  * 

1  l  l,  j  =  l  k=l  i  J 

for  all  ij>  in  the  domain  of  L.  The  functions  f ,  g  are  assumed  to  be  measurable 
mappings  and  to  satisfy  continuity  and  growth  conditions  (e.g.,  [1,2])  suffi¬ 
cient  to  insure  that  {x^.}  exists. 

In  the  filtering  problem  (x^}  is  not  directly  observed,  but  is  observed 

through  a  related  process  {y  t > 0}  satisfying 

dyt  «  h(xt)  dt  +  dvt  (1.2) 

where  {v^,  t^0}is  a  standard  Brownian  motion,  Ft~measurable  and  h(*)is 

n  k 

some  mapping  from  F  to  range  of  {yt},  say  K  .  The  problem  is  then  to 
compute  conditional  statistics  of  {x^.}  given  only  the  o-field  generated  by 

{ys.  0<  s  <  t}. 

Present  Status  of  Filtering  Theory 


Considerable  progress  has  been  made  in  this  problem  by  dealing  with  the 
so-called  Duncan-Mortensen-Zakai  equation  [2,11,12]  for  the  evolution  of  the 
unnormalized  conditional  density,  p(x,  t),  given  by 


) (x , t )  =  p(x  ,  0)  +  /  L*p(x,s)ds  +  /  p(x,s)h(x)dy 


(1.3) 


-2- 


where  L *  is  the  formal  adjoint  of  L  and  the  last  integral  is  to  be  interpreted 
as  a  stochastic  integral  [1,2].  The  progress  in  understanding  the  nature  of 
nonlinear  filtering  is  due  primarily  to  the  bilinear  form  of  this  equation 
and  the  connections  made  between  (1.3),  the  theory  of  nonlinear  partial 
differential  equations  and  nonlinear  systems  theory  [4,5,61. 

In  particular,  connections  between  the  Lie  Algebraic  properties  of  the 
algebra  of  operators  induced  by  (1.3),  or  more  precisely  the  Stratonovich  form 
for  (1.3), 

L  -  {L*  -  y  h2 ,  h)LA 

and  the  finiteness  conjecture  of  Brocket  [4]  have  proven  to  be  the  most  fruit¬ 
ful  avenues.  Although  the  fundamental  Homomorphism  Conjecture  [4,6,25] 
required  to  close  the  loop  on  nonlinear  filtering  theory  has  been  elusive  in 
the  general  case,  (see  however,  [19,21,25  &  26)  many  striking  results  have 
evolved  from  this  line  of  reasoning,  for  example  [13-20], 

The  status  of  approximate  filtering  for  this  problem  framework  is  less 
satisfactory.  In  a  limited  class  of  problems  the  aforementioned  Lie  Algebraic 
studies  have  produced  reasonable  results  [13,14,21].  More  generally  appli¬ 
cable  approaches  include  the  time  worn  extended  Kalman  filter  and  second 
order  approximations  [24]  and  a  few  less  well  known  series  expansion  tech¬ 
niques  [21-23].  However,  it  has  been  strongly  suggested  in  the  literature 
that  Lie  Algebraic  techniques  may  be  a  profitable  vein  of  inquiry  for  approx¬ 
imate  filtering  in  a  more  general  context  than  has  so  far  been  attempted 
[6 , 12a, 13 ,25,  especially  27,  28]. 


-3- 


The  remainder  of  this  report  deals  with  a  rather  different  tact  for 
approximate  filtering  applications.  The  approach  taken  is  to  consider 
problems  in  which  the  state  process,  in  addition  to  the  conditions  above, 
admits  pointwise  transformation  to  a  potentially  more  useful  representation. 
Chapter  2  develops  the  details  of  the  transformation  theory  for  a  linear 
representation.  Although  this  material  is  built  straight  forwardly  upon 
work  in  deterministic  linearization  [7,9],  very  concise,  as  well  as  con¬ 
structive,  procedures  are  developed  to 

a.  test  any  given  nonlinear  stochastic  differential  state  equation  for 
the  extistance  of  an  alternate  "controllable”  linear  representation, 
and 

b.  exactly  determine  the  smooth  transformation  of  state  variables 
providing  this  alternate  representation. 

Chapter  3  then  indicates,  by  way  of  an  interesting  example  of  practical 
significance,  how  this  transformation  theory  can  be  applied  to  formulate 
approximate  state  estimation  filters.  The  example  studied  is  subjected  to 
Monte  Carlo  simulation  to  demonstrate  the  potential  performance  advantage  of 
the  suggested  a,..  ion  of  the  transformation  approach  over  other  extended 


Kalman  filters 


In  an  analogous  fashion  we  have  the  vector  spaces 

n  .  .  .  ... 

A2(Bn)=  (to(x)  | (jJ (x)  =  £  a)1'1  (x)dx1  A  dx'' ,  u)1'' (c)C°°} , 

i»j=l 

A3(Rn),  An(Bn)  of  p-forms  (p=2 ,3 , . . . ,n).  We  also  Identify 

A°(Rn)  with  F(Bn)  •  The  collection  of  all  p-forms  over  Rn  (o  _<p  <^n)  together 
with  the  exterior  product  defined  as  above,  and  extended  in  the  obvious  way 
to  any  oj  c  A(En)  ,  defines  the  Exterior  Algebra,  (A(Rn)  ,  A,  F(Rn)), 
or  simply  A(Rn) . 

If  in  addition  we  have  an  inner  product  on  Rn  then  using  (2.14)  let 
x;AP*(Iin)  and  y  e  A^(Rn)  we  have  a  pairing  between  A(Rn)  and  its  dual: 


<x|y>  - 


0  ,  P  4  Y 

det(xi>  y~* ) ,  P  =  Y 


(2.15) 


where  AP(R°)  =  space  of  exterior  p-forms  over  Rn. 


We  summarize  the  major  properties  of  p-forms: 

(a)  if  dim  T(R  )  =  n  then  An(Rn)  is  1-dimensional 

(b)  AP(Rn)  has  dimension(n) 

P 

(c)  a  basis  for  AP(Rn)  is  dxlxA ♦ • • Adx1?;  i, ,  •••,  i  =  1,  •••,  n 

1  P 

(d)  dxm  A  dxm  =  0;  if  uieA(]Rn),  w  A  (0  =  o 

(e)  (Exterior  Differentiation)  d^:  AP(]Rn)->-  AP+*(Rn);  p  <  dim  7'(Bn)-l 

n 

If  ii)=^  a^j,  (x)dx1*  A  •••  AdxXP 


n  n 


then  dw  =  \  \ 

1  =  1  1 

(f)  A°(Fn)  =  F{  Rn) 


dx^  A  dx^^  A  • • •  Adx^P 


(«)  by  (a),  if  ait:  An()kn)  and  dim  Y'(Rn)  -  n,  da;  =  0 
(h)  r  f  d ,  =0  then  ...  is  Closed, 
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The  natural  pairing  between  TCRn)  and  A1(Rn)  is  expressed  by 


<T-n>*  ■  1 1  t1(x)  Vx>  (dxJ’  317). 


=  l  t 1 (x)  n .  (x) 

1  1 


(2.14) 


where  the  dual  basis  elements  satisfy  the  relation 

(dx:i>  air)  ‘  6ij 

IX 

Hence  :  T(Rn)  x  A1(Fn)  -*■  F(Rn)  is  a  bilinear  mapping  of  the  tangent 

space  and  its  dual,  and  thus  defines  the  action  of  n  £  A1(Rn)  as  a  linear 


functional  on  T(R  )  or  vice  versus. 


From  the  vector  space  structure  on  Ax(Rn)  it  is  possible  to  construct 
an  algebra  of  differential  forms  known  as  the  Exterior  Algebra,  A(]Rn) .  The 
product  operation  used  to  construct  A(Rn)  is  called  the  "wedge"  or  exterior 
product.  It  Is  a  skew  symmetric  multiplication  of  differential  forms  defined, 
in  coordinates,  as  follows: 

n  n 

For  V,n  £  A1(Rn)  ,  v(x)  =  £  V.(x)dx1  and  n(x)  =  £  n.(x)dx1 

1  1  1  1 


II  II 

n(x)  =  l  l  v  (x)n. (x)dx 
i=l  i =1  J 


1  A  dxJ 


where 


dx  *  r'  dx^  = 


1  =  J 


-dx^  dx1  i  ^  j 
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I 


3.  SINGLE  INPUT  N-DIMENSIONAL  STOCHASTIC  DIFFERENTIAL  EQUATIONS 


I 


B 

I 


Geometric  Preliminaries 

This  section  addresses  the  n-dimenslonal  problem.  Because  of  the  added 
complexity  of  this  problem,  we  first  review  several  well  known  results  from 
Differential  Geometry  and  the  Exterior  Calculus  which  will  be  useful  in  what 
follows.  Everything  we  will  do  Is  in  Rn  ,  so  the  discussion  is  restricted  to 
these  spaces.  Much  of  the  general  material  presented  comes  from  [10]  and 


111]. 


Smooth  (C00)  mappings  from  E  -*•  En  induce  vector  fields  over  En  .  The 
natural  (Euclidean)  coordinate  system  on  En,  (x^,  ...  xn) »  Induces  an  atlas  on 
Rn  .  With  this  atlas,  E°  is  viewed  as  a  differentiable  manifold  and  the 
induced  vector  fields  over  En  can  be  identified  with  the  vector  space  of 
linear  functionals  on  the  space  of  functions  on  this  manifold,  F(En):  B*1-*-  E  . 
Under  this  identification,  a  natural  basis  (in  a  coordinate  neighborhood)  on 
the  space  of  vector  fields  over  E°  (i.e.  the  Tangent  Space  to  En  -  27(En))  is 


3 

3x 

n 


}  . 


Thus  a  "tangent"  vector  field  in 


!T(E  )  is  given  (locally)  by 


x(x)  =  l  xi(x)  ,  (2.13) 

I  i 

where  x  e  En  and  the  tL(*)  are  C°°  functions  on  En  .  The  dual  space  to  the 
tangent  space  !T(En) ,  is  f*(En)  "T  (En  )  A  A1(En)  and  carries  the  corresponding 
dual  basis  (dx1,  • ■ • ,  dxn).  Elements  of  Ax(En)  are  called  1-forms  (differential 
forms  of  degree  l)  and  have  the  coordinate  respresentation 

n 

n(x)  =  \  n. (x)dx1. 

1  1 


I 
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h(x)  is  at  least  once  continuously  differentiable.  Then  applying  extended 


Kalman  theory  (EKT)  to  the  transformed  equations  gives  the  state  estimate 
dzt  =  az^dt  +  PtHt(dyt  “  hc(zt)dt) 

3h 


H 


if  <:t> 


(2.12) 


P,  =  2aPt  +  b2  -  Pj  «2 
The  approximation  enters  only  through  the  function  hc  in  this  example. 

Based  on  this,  one  might  expect  z^  to  be  a  better  approximation  to  the 
conditional  mean  estimate  E{zt|Fy}  than  the  corresponding  approximation  for 
ECxJ  F^}  developed  through  application  of  EKT  to  the  original  stochastic 
differential  equations  in  certain  cases.  This  conjecture  is  investigated  in 
Chapter  3  for  an  interesting  practical  example. 


Example  3  Consider  the  s.d.e. 

x  a  (x.)  (4  +  q2(x  ))q(x  ) 

dx  =  -t  ~  ..5-  (a-b2  - - -  )dt  +  b  - 

'  q+(xt)  <<*t>  W 


dw 


t 


where  q+  (x)  =  (1  ±  x2)  .  With  the  initial  state  value  |xo|  <1 

with  probability  1,  the  diffusion  process  {xt>  t  >_0}  has  as  its  state 

space  the  open  interval  (-1,1).  Applying  Theorem  1,  it  is  found  that  under 

(x)  =  -X—  =  u 
1-x2 

dUj.  =  au^dt  +  bdwt  . 


The  next  section  makes  use  of  a  2-dimensional  generalization  of  this 
example. 
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for  which  the  equivalent  Gauss-Markov  diffusion  would  be 

dz^  =  az^  dt  +  bdwt  (2.10) 

If  in  addition  the  observation  is  of  the  form 

dyt  =  ^t^1  +  3"  xt^dt  +  dvt  (2 •  11) 

and  z  =  x  (1  +  —  x2)  is  distributed  G(0,  a2)  then  the  optimal  filtering 
o  o  3  o  o 

problem  for  {xt»  t^Ojis  solved  by  the  Kalman-Bucy  equations  for  the  equivalent 

problem  (2.10)  and  (2.11).  The  conditional  density  is 

p(x,t|v  ,  0<  s<  t)  =  — (1  +  M-  exp  {-  y[x(l  +  j  x  )-  zt]2/P(t)} 

'S  “  (2lTP(t))1/2 

where  tzt»  P (t ) }  are  the  solution  of  the  Kalman  filter  for  (2.10)  and  (2.11). 
NOTE:  The  unappealing  Gaussian  initial  condition  is  easily  circumvented. 

The  work  of  Benes  and  Karatzas  (9a]  or  Makowski  [9b]  can  be  applied  to  handle 
arbitrary  initial  state  distributions. 

The  foregoing  example  is  rather  contrived;  it  seems  unlikely  in  practice 
that  one  would  encounter  a  problem  which  is  both  transformable  to  a  linear 
s.d.e.  and  has  just  the  right  observation  drift.  However,  in  the  next  example 
the  potential  application  of  the  transformation  theory  to  develop  approximate 
solutions  is  indicated.  Further  justification  for  why  one  might  use  this 
approach  is  postponed  until  Chapter  3. 


Example  2  Assume  the  state  equation  satisfies  the  conditions  of  Theorem  1  and 
has  the  equivalent  linear  representation  under  transformation,  z t  =  $(xt) 

dz^  =  az^dt  +  bdw^ 

Assume  the  observation  equation  is 
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.  »  l  U'V>M  ^  1  V ‘l  m 1  % »  VT  V ^  f  J i  i 


In  this  case  Che  C°°  transformation  relating  the  two  diffusions  is 


z 


<j>(x) 


b  Q(x) 
a  g(x) 


Remarks : 

The  theorem  as  stated  is  valid  only  for  a^O  and  thus  restricts  Sx  *=  It. 
Generalizing  slightly  by  using  a<f>(x)+c  on  the  r.h.s.  of  (2.7)  and  allowing 
a=0  admits  situations  like  Sx  =  E+  -  (0).  The  only  other  modification 
required  for  this  case  is  that  <j>(x)  is  constructed  by  solving  the  latter 
half  of  (2.6)  instead  of  (2.7).  This,  for  example,  covers  the  well 
known  case  of  a  bilinear  s.d.e. 


We  give  several  examples  of  the  use  of  Theorem  1. 


Example  1  Consider  the  diffusion 


Here, 


dx 


ax^l-fr^x2) 


b2x. 


(1  +  x2  ) 


(1  +  x2)3_, 


dt  + 


dw. 


(1  +  x*)  1 


Q(x)  =  ax 


(1  +  y  x2) 

(1  +  X2) 


(2.8) 


so 


g(x)  = 


(1  +  Xz) 


Q(x)g"(x)  -  g(x)Q'(x)  +  ag(x) 


ab  -  ab 
(1  +  x2) 

By  choosing  a=a  ,  Theorem  1  is  satisfied  and  the  transformation  is  given  by 


z  =  4> (x)  =  x(l  +  x2) 


(2.9) 
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- .v ,s  . . 


*  ■  i 


The  only  consistent  solution  for  this  equation  occurs  when  the  bracketed 
term  is  identically  zero.  Substituting  back  into  the  remaining  equation 
gives 

[fW  -  \  g(X)]  -  a<Kx)  (2.7) 

1  3b 

Define  Q(x)  *  f(x)  g(x)  and  notice  that 

(x)  Q  (x)  +  ||  (x)  ||  (x)  -  a  ||  (x). 

Using  these  it  follows  that 


||  (x)  {Q(x>  ||  (x)  -  H  (x)  g(x)}  -  -  a  ||  (x)g(x). 

Then  letting  [Q,  g](x)  -  Q(x)  |&  (x)  -  ||  (x)g(x), 

and  combining  with  (6)  we  have  the  necessary  and  sufficient  equation, 

(x){[Q.g](x)  +  ag(x)}  -  0  . 


The  desired  transformation  may  be  found  by  solving  (2.7)  and  using  (2.6): 

«x, 


The  equivalent  linear  s.d.e.  is 

dzt  =  d<J>(xt)  =  a<J>(xt)dt  +  bdwt 


Summarizing,  we  have: 


=  az. 


dt  +  bdw^ 


THEOREM  1  The  C°°  non-degenerate  diffusion  on  F,  specified  by  the  s.d.e. 
dxt  =  f(xt)dt  +  g(xt)dwt>  is  equivalent  to  a  linear  s.d.e.  on  K  of  the  form 
dzt  =  az^dt  +  bdwt  ,  (a  ^  0) 


if  and  only  if 

(i)  |g(x))  >  0 

(ii)  Q(x)  =  f  (x)  -  j|f^- 

?&.(*)  z(xy^ 


g(x),  Q (0) 


+  a  g(x)=0 


=  0 

V  x  e  R 


( iii)  Q(x) 


2.  SCALAR  DIFFUSIONS 


The  general  scalar  diffusion  model  of  interest  is 

dxt  =  f(xt)dt  +  g(xt)dwt.  (2.3) 

Heretw^,  t^>0}  is  a  standard  Brownian  Motion  on  R  and  the  functions  f(x), 
g(x)  satisfy  the  conditions  stated  in  Section  1.  The  state  space,  Sx,  is 
also  assumed  to  be  an  open  set  in  R.  The  reason  for  this  restriction  is  to 
ensure  that  (f(x),  g(x)),xe  interior  (S^)  completely  determine  the  characteristic 
operator  of  the  diffusion  {x  ,t>0},X,  at  every  point  in  S  .  In  simpler 

t  X 

terms,  this  assumption  excludes  all  but  natural  boundaries  from  the  state 
space.  More  will  be  said  about  this  in  the  remarks  to  follow. 

What  is  sought  is  a  smooth  mapping  <J>,  taking  x^  into  in  such  a  way 
that  the  s.d.e.  corresponding  to  2.3  is  linear  and  of  the  form 

dz^  »  azt  dt  +  bdwfc  (2.4) 

where  a,b  are  constants  and  a?*0.  Assuming  that  the  transformation  is  at 

least  twice  continuously  differentiable,  the  Ito  differential  rule  gives 

,  92<j)(x )  -j  d<p(x) 

d*(xt)  ■  [_— sr-  f  <*t> + 1  — ,  8  (xt>  dt  +  -5T"  8<xt)d”t  <2-5) 

Letting  z ^  0 ( x t )  and  equating  corresponding  terras  in  (2.4)  and  (2.5)  yields 


?<Kx)  .  az4>(x) 

-^T~f(x)  +  I  Ijr-  s2<x>  = 


9Q(x) 

dx 


g(x) 


=  b 


(2.6) 


for  all  x  e  S  . 

x 

To  solve  these  simultaneous  equations,  first  note  that  due  to  the  second 


part  of  (6), 


_9_ 

9x 


3<ft(x) 
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5  discusses  two  further  aspects  of  the  transformation  idea.  The  first  is  the 
targeting  of  nonlinear  s.d.e.'s  in  the  transformation  by  way  of  an  example 
using  the  Benes  state  equation.  The  second  topic  deals  with  mapping  nonlinear 
s.d.e.'s  to  higher  state  dimension  linear  representations  within  the  context 
of  the  material  in  Section  3.  The  final  section  summarizes  the  main  results 


of  the  chapter. 


n 


to  a  linear  stochastic  differential  equation.  In  so  doing,  this  work  is  a 
direct  extension  of  the  ideas,  but  more  importantly  a  simplification  of  the 
methods,  in  [12,22]  to  nonlinear  stochastic  differential  equations. 

In  the  remainder  of  this  chapter,  the  class  of  problems  defined  by 
nonlinear  s.d.e.s  which  can  be  pointwise  transformed  into  an  equivalent 
version  in  the  class  of  linear  s.d.e.s  is  characterized  in  detail.  A  proce¬ 
dure  for  constructing  the  required  transformation  and  for  recognizing  when  a 
particular  nonlinear  s.d.e.  is  amenable  to  this  transformation  are  determined. 

In  the  next  section,  the  1-dlmensional  problem  is  discussed  to  delineate 
the  background  ideas  involved  in  the  more  difficult  n-dimensional  cases.  The 
existence  of  an  equivalent  linear  representation  for  a  scalar  nonlinear 
s.d.e.  is  characterized  in  terms  of  a  certain  ordinary  differential  equation. 
This  easy  and  straightforward  case  is  presented  first  because  of  its  expository 
value  for  the  uninitiated  reader.  Further,  this  discussion  allows  a  convenient 
introduction  of  both  an  (admittedly  contrived)  optimal  filtering  application 
and  a  potentially  useful  approximate  filtering  application  of  the  general 
transformation  theory.  Section  3  addresses  the  transformation  of  n-dimensional 
nonlinear  s.d.e.'s.  Tools  from  Geometry,  which  are  used  in  the  n-dimensional 
setting,  are  summarized  first.  These  tools  are  employed  to  describe 
necessary  and  sufficient  conditions  (similar  to  those  in  [12])  a  nonlinear 
s.d.e.  must  satisfy  to  admit  transformation  to  a  linear  s.d.e.  target 
representation.  Simplified  practical  tests  are  devised  for  determining 
existance  of  the  necessary  transformation  of  state  coordinates,  and  a  simple 
constructive  method  is  provided  for  determining  this  transformation.  Examples 
are  also  given.  Section  4  presents  two  straight  forward  extensions  of  the 
results  of  previous  sections  to  raulti-input/raulti-output  and  time  varying 
stochastic  differential  equations.  Examples  of  each  are  discussed.  Section 


The  standard  problem  In  nonlinear  filtering  is:  given  (2.1)  and  an 
observed  process  (yt»  t^O) ,  related  to  (xt»  t^O}by  the  s.d.e. 

dyt  «  h(xt)dt  +  dvt  ,  (2.2) 

where  {y^,  t  >  0}  e  E^,  h:En  -*•  E**  is  Borel  measurable  and  {vt,  t  >  0}  is  vector 
valued  Brownian  Motion,  construct  an  estimate  of  {xt»  t^O}  given  (ys>  O^s^t}. 
Optimal  filtering  solutions,  in  the  mean  square  error  sense,  have  been 
constructed  in  a  few  cases:  Gauss-Markov  case  (Kalman-Bucy  theory);  the  Benes 
problem,  where  f(x)  satisfies  a  Riccati  differential  equation,  g(x)  is  a 
constant  and  h(x)  is  linear  (not  affine)  in  x  [2,3];  conditionally  Guassian 
processes  [4];  discrete  state  Markov  problems  [5];  and  the  multidimensional 
cases  constucted  by  W.  Wong  [6],  to  name  several.  Approximate  filtering 
solutions  for  other  situations  are  somewhat  limited  and  include  the  extended 
Kalman  techniques  and  a  handful  of  series  expansion  (or  related)  approaches 
when  "small"  nonlinearities  are  involved  (see  e.g. ,  [19],  [20]  and  [21]). 

Brockett  [7],  Baras  [9]  and  others  [22]  have  studied  the  idea  of 
equivalence  classes  of  filtering  problems  by  considering  guage  transformations 
pointwise  state  transformations,  and  what  Baras  has  called  Group  Invariance 
methods.  The  concept  here  is  to  identify  when  a  given  problem,  equations 
(2.1)  and  (2.2),  or  (2.1)  alone,  is  merely  a  disguised  version  of  another 
perhaps  better  understood  problem.  Similar  notions  have  been  discussed  in 
the  deterministic  control  systems  literature  as  far  back  as  1973  by  Krener 
[22]  and  more  recently  by  Su  [12],  and  Hunt,  Su  and  Meyer  [13]. 

Motivated  by  these  equivalence  ideas  and  by  a  desire  to  obtain  alterna¬ 
tives  to  standard  extended  Kalman  filters,  this  report  develops  a  set  of 
constructive  tests  through  which  it  can  be  determined  when  a  diffusion  satis¬ 
fying  the  s.d.e.  (2.1)  can  be  transformed  pointwise  to  another,  possibly  more 
useful  form.  In  particular,  the  bulk  of  the  report  deals  with  transformation 


CHAPTER  2 


TRANSFORMATION  THEORY  FOR  NONLINEAR  STOCHASTIC 
DIFFERENTIAL  EQUATIONS  ON  En 

1.  INTRODUCTION 

Nonlinear  filtering  problems  frequently  model  the  “signal”  as  a  diffusion 

process  satisfying  a  stochastic  differential  equation  (s.d.e.)  of  the  form 

dxt  -  f(xt)dt  +  g(xt)dwt  ,  (2.1) 

where  {wt,t^0}is  a  standard  Brownian  Motion  on  Rm,  m^>l,  and  {xt,t>^0} 
e  Kn,n>m.  g(x)  -  (g*j(x))  is  a  Borel  measurable  (in  general  matrix  valued) 
function  (x)e  Rn-»- Kn  ®  Em  and  f(x)  “  (f^(x))  is  a  Borel  measurable  (vector 
valued)  function  (x)  e  En  ->-En.  In  this  case,  the  s.d.e.  (2.1)  is  said  to 

be  of  Markovian  type  [1].  A  s.d.e.  thus  generalizes  the  notion  of  an  ordinary 
differential  equation  by  adding  the  effects  of  random  fluctuation. 

For  the  purposes  of  this  report,  the  drift  (f(x))  and  diffusion  (g(x)) 
coefficients,  and  the  process  defined  by  (2.1)  are  restricted  further: 

(i)  f,g  are  assumed  to  be  continuously  differentiable  (C°°)  functions  of 
their  arguments. 

(ii)  we  specifically  exclude  those  s.d.e.'s  in  which  g(x)  **  0  for  some  x 
in  the  state  space,  Sx. 

The  first  restriction  ensures  that  the  needed  differentiability  of  f  and  g  is 
always  available  and  allows  one  to  avoid  the  tedium  of  keeping  track  of 
the  degree  of  differentiation.  The  second  restriction  voids  degenerate 
stochastic  differential  equations  and  thereby  guarantees  that  the  equations 
we  wish  to  solve  (see  below)  do  not  exhibit  explosions.  Diffusion  processes 
governed  by  s.d.e.'s  satisfying  these  conditions  will  henceforth  be  referred 
to  as  Cro  non-degenerate  diffusions. 
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If  w  =  da  then  us  is  Exact 


(i)  Poincares  Lemma:  d(doo)  =  0,  weAP( Rn)v  p  <  dim  r(En) 


Linear  Equivalence  Transformations  in  n-Dimensions 


In  the  spirit  of  Section  II,  and  using  the  tools  of  the  previous  paragraph, 


we  now  consider  the  stochastic  differential  equation, 

dxt  =  f(xt)dt  +  g(*t)dV  ^Xt’  t-0,  X  £  SxC  • 


(2.16) 


where  we  assume  Sx  is  a  open,  simply  connected  subset  of  K  . 

The  question  we  ask  is:  are  there  diffusions  of  the  type  (2.16)  which 


are  equivalent  (C°°  related)  to  a  diffusion  of  the  type 

dzt  =  Az^dt  +  dwt>  {zt,  t  >;0,  z  e  »n)  . 


(2.17) 


Krener  [23],  Su  [12]  and  Hunt,  Su  and  Meyer  [13]  have  answered  related  questions 

for  deterministic  nonlinear  differential  equations  of  the  form  x  =  f(x)  +  g(x)u(t) 

By  properly  interpreting  the  relation  between  (2.16)  and  the  vector  fields 

defined  by  the  drift  and  diffusion  terras  we  can  extend  their  results  to  the 

s.d.e.  case.  We  proceed  as  follows: 

We  seek  a  transformation  <p:  ■+  R  which  is  sufficiently  smooth  (C°°) 

n  ~ 

so  that  z^  =  4>(xt)  is  a  diffusion  on  R  and  satisfies  Ito's  rule.  That  is 


.  \  .  9773x7  gi(x)gj(x)}  dt  + 
i»j=i  j  i 


(2.18) 


{  l  977  8i(x)  dwt} 


and  under  which  we  have  the  equivalent  linear  stochastic  differential  equation 


dzt  =  Az^dt  +  bdw^ 


(2.19) 


for  some  (nxn)  matrix  A  and  (nxl)  vector  b.  We  make  the  following  assumptions: 
r()(0)  =0,  <f>  is  1-1  and  invertible,  <f>  is  C°°  on  Sx  and  the  pair  (A,b)  is 
controllable  (in  the  usual  sense  of  time  invariant  linear  systems). 


.  >w..7.  *u . , -  Sc..,.  ,,v.7  i  j  **a*»U*.. 


-19- 


Now  note  the  following  result.  If  <J)  exists  such  that  equations  (2,18) 
and  (2.19)  are  equivalent  descriptions  of  the  diffusion  (z^,  t^O)  then  for 
m  -  lj  2y  « « •  |  n 


n 


n 


l 

i=l 


d2<p 

Tm 

3x.3x.  ®i 
J  i 


n  n 


-  -  X  l 

j=l  i=l 


30  3g , 

Ym  6i 

3x .  3x .  ® j 
i  J 


(2.20) 


This  is  obtained  by  equating  the  diffusion  terms  in  (2.18)  and  (2.19),  differen¬ 


tiating  with  respect  to  x j ,  multipling  by  gj  and  summing.  If  we  interpret 

g(x)  as  a  vector  field  over  >  then  in  terms  of  coordinates 

n  3g 


dg(x)  =  l  l 


j-i  i-i3xi 


j  i  .  3  . 

-  (dx  A  )  , 


3x , 


which  is  easily  Interpreted  as  a  linear  functional  on  T(S  ).  That  Is,  dg 
acts  on  2'(Sx)  by  way  of  the  pairing  <•••>  .  For  example. 


<dg,  g)  =:  <§f)g 


3e  “  “  98i 

and  (g^)g  =  I  <  l 


3x  gP  W-  €  r(Sx} 
i-i  j-1  j  2  i 


(2.21) 


Using  (2.20)  and  (2.21)  and  equating  terms  in  (2.18)  and  (2.19),  we  arrive  at 

the  conditions  for  equivalence  (using  exterior  differentiation,  d) : 
let  0  =  (<J>  ,  **•>  4>n)  then 


^(x),  Q(x))  =  l  am.  4>i(x) 
(— ^m(x)  ’  8(X))  =  bm 


(2.22) 


where  A  =  (a^)^,  b=(b^ ,... ,bn)^,  prime  denoting  transpose,  and  Q(x)  signifies 


the  vector  field  induced  by 


Q(x)  =  f(x)  -  j  ^£0  g(x) . 


-20- 


Remarks : 


(1)  Equations  (2.22)  are  the  linearization  conditions  obtained  for  the 
deterministic  differential  equation  x  =  Q(x)  +  g(x)  u(t) 

(2)  The  functions  Q(x)  =  f(x)  -  y  g(x)  are  precisely  of  the  form 

required  to  associate  the  s.d.e.  (2.16)  with  its  induced  vector 
fields  on  !T(Sx)  [14].  As  is  well  known,  Q(x)  represents  the  vector 
field  induced  by  the  Stratonovich  form  of  Eqn.  (2.16). 


Necessary  and  Sufficient  Conditions  for  Local  Equivalence 

To  solve  the  simultaneous  partial  differential  equations  (22)  easily,  we 
need  to  apply  the  Frobenius  Theorem  [10].  To  make  use  of  this  theorem  we 
need  assume  that  the  pair  (A,b)  is  controllable. 

Following  the  procedure  of  Section  II,  equations  (2.22)  in  matrix  form  are 

(dcf>,  =  A  <J> 


^d<[>,  g)  =  b 


(2.23) 


Defining  the  Lie  derivative  by  L^(g)  =  [Q.g]  where  [*»*]  is  the  Lie  bracket. 


and  then  iteratively 


Lq(8)  =  [Q.  L“  \g)] 


Lq(g)  =  g 


m 


0,  1,  . 


and  imitating  the  method  of  Su  [12]  we  obtain  the  (necessary  and  sufficient) 

equations  for  equivalence  as, 

^d4>,  L(Q,  g)^  =  C(-A,b)  .  (2.24) 

Here  C(-A,b)  is  the  controllability  matrix  of  the  pair  (-A,b),  L(Q,g)  is  the 
matrix  whose  columns  are  given  by  the  vector  fields  (Lq(g)»  •••,  *(g)), 

d4>  Is  the  vector  of  1-forms  (d4>j>  d_4>2 ,  ••*,  d4>n) '  and  we  have  generalized  the 
natural  pairing,  <•••)  , to  these  "vectors"  in  the  obvious  way.  In  more  explicit 


notation,  let  x  =  (x^,  ...»  x^)  be  a  Euclidean  coordinate  system  on  Sx  and 

dx  =  (dx^  ,  dx2 ,  •  •  • ,  dxn)  ' 

3.3  3  v 

as-  -  <-sr-  "  •  ir> 

1  n 

then^dx,  =  (^j)  =  I»  the  nxn  identity  matrix  and  hence 

(f£)  dx«  ^  L  =  C(-A,  b) 


(!*)  = 
('3xJ 


(2.25) 


8r  (Q,g]1 


gn,  [Q.gln 


[Q.Lg  1(g>l1 


[Q.L^1(g)ln 


The  main  integrability  result  of  Differential  Geometry,  the  Frobenius 
Theorem,  is  noted  for  completeness: 


THEOREM  2  Let  D  be  a  "differential  system"  of  dimension  p  on  an  n- 

dimensional  differentiable  manifold  M.  Let  the  vector  fields  {X.  ,  •••,  X  } 

i  n 

be  linearly  independent  and  let  V(D)  be  the  set  of  all  vector  fields,  X, 
such  that  X(r)  £  D(r)  V  r  e  M.  Then  D  is  completely  integrable  if  and 
only  if  V(D)  is  a  Lie  Algebra  over  F(M).  Thus  if  {X^ ,  X  }  span 


.*•  .  V-'.vV  ' 
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V(D)  on  a  neighborhood  U,  then  D  is  completely  integrable  if  and  only  if 
there  are  (C00)  functions  C*  :U-*-R  such  that 

jk 

P 

(V  Xj]  “  l  Cjkxi»  11  j.  k<p  (2.26) 

In  [12],  this  is  applied  to  the  deterministic  linearization  problem  under 
feedback  to  show  the  existance  of  a  dif f eomorphism  when, 

(1)  <8.  L*(g).  •••.  are  linearly  independent  in  T(SX)  in  a  neighborhood 

of  the  origin,  U. 

(ii)  Span{[L™(g),  L*(g)],  0£m,  j£n-2}  is  a  Lie  Algebra  over  F(SX)  in  T(SX) 
with  {[g,***,  l”~2 (g) }  in  U. 

(iii)  Q(o)=0 

In  the  present  setting,  a  linear  s.d.e.  representation  for  (2.16)  exists 
(locally)  if  the  above  conditions  hold.  Obviously,  if  these  conditions  hold 
on  all  of  T(SX)  then  the  equivalence  is  global. 

In  addition  to  the  Frobenlus  Theorem,  it  is  shown  in  [13]  that  a 
sufficient  condition  for  <p  to  be  a  global  transformation  providing  equivalence 
is, 

(iv)  the  Jacobian  of  <p  satisfies  the  (noncharacteristic  matrix)  ratio 
conditions  on  Bn. 

This  assures  that  <J>:  -*■  Rn  is  1-1  on  all  of  Kn.  The  proof  of  the  sufficiency 

of  this  condition  is  given  in  [13]. 

The  necessary  and  sufficient  condition  (ii)  is  cumbersome  to  verify, 
when  n  is  large.  A  more  practical  test  for  this  is  available  by  using  a  version 
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of  the  Frobenius  theorem  stated  in  terms  of  the  1-forms  dual  to  the  set 

L(Q,g).  This  is  motivated  in  the  discussion  which  follows. 

Assume  L(Q,g)  is  a  linearly  independent  set  of  vector  fields  on  some 
r  1  n^ 

UcS^.  Let  wL/  be  the  first  dual  to  this  set.  Then 

(a)  u)  is  a  vector  whose  components  are  1-forms  in  AX(S  ) 

L  x 

3  3  In 

(b)  Since  a  dual  basis  to  {-g^y,***»  ^  on  ^x  *  ***>  dx  K 

wT  spans  U*  <=  A1(S  ) 

L  x 


(c)  Using  the  matrix  notation  of  (2.25),  if 

L(Q.g)  -  (L*  (2.27) 

then  we  have  the  representation: 

=  (L  *)dx 

(d)  Since  L(Q,g)  span  U^,  L  *  exists  (L  is  non-singular  on  all  of  U). 

(e)  Equation  (2.24)  is  equivalent  to  the  equation 


d<}>  *  C(-A,  b) u) 


(2.28) 


Remarks : 


(1)  Equation  (2.28)  is  a  set  of  coupled  partial  differential  equations  for 


<)>(x)  (°n  U).  Our  goal  is  to  find  a  solution  of  the  form, 

<Kx)  -  <j>(0)  =  /  C(-A,b)u>.  • 

[o,x]  CU 


(2.29) 


This  is  a  standard  problem  in  the  integration  of  differential  forms  on 
the  manifold  Rn  (or  Sx) . 

(2)  Because  (A,b)  is  controllable,  (2.19)  can  be  chosen  as  a  controller  form. 
Then  C(-A,b)  is  upper  triangular  and  $n(x)  will  depend  only  on 
Then  remaining  components,  j  can  be  built  up  from  this  solution. 

Hence  the  n  equations  can  be  solved  one  at  a  time,  starting  at  n  and 
working  downwards,  similar  to  the  procedure  in  [12],  As  will  be  seen  in 
the  sequal  this  is  totally  unnecessary  as  (2.29)  can  be  integrated 
directly. 
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(3)  If  to£  were  exact  then  it  is  clear  that  <P  would  exist  and  equation  (2.28) 

solved  by  solving  the  related  equations”  =  d0,  where  0  is  a  o-form.  By 

Poincare's  lemma,  one  would  like  to  say  that  if  dto”  =  0  then  a  0  £  A0  (S  ) 

L  X 

such  that 

<f>(x)  =  C(-A,b){0(x)  -  0(0)}  +  <J>(0) 

This  is  in  fact  true  globally  on  Sx  (see  the  version  of  the  converse  to 
Poincare's  lemma  below).  This  leads  us  to  the  result  stated  in  Theorem  4 
below. 

First  we  state  a  version  of  Frobenius  Theorem  in  line  with  the  kind  of  result 
we  are  looking  for. 


THEOREM  3  Let  D  be  the  differential  system  of  dimension  p  on  the  n- 
dimensional  manifold  Sx  as  before.  Let  to^  =  (to^,  •••,  to”}  be  the  dual  to 

L(Q,g)  on  U  then  let  1(D)  »  (set  of  differential  forms  ft[Q(r)} vanishes 

n 
°L 


on  D (r)  V  r  e  S  }.  Hence,  on  U,  to^+1 ,  w”  generate  1(D)  and  D  is  com- 

X 


pletely  integrable  if  there  exist  C°°  functions  C7,  on  Sv  such  that 

jk 

*4  "  ^  C1kWL  A  WL 

p<j<k  L 


(2.30) 


Proof:  (see  Sternberg  [10])  On  U*,  since  L(Q,g)  is  a  linearly  independent 
set,  we  have  uniquely  the  dual  1-forms  (a  basis  on  A1(S  )),  {coT,***,  to”}  . 
On  U* ,  1(D)  is  generated  by  {oo£  .  ***»  w^}  •  As  in  Theorem  2,  {X^ ,  •••,  X^} 
span  D.  We  have,  using  the  relation  from  our  preliminaries, 

(XiAKjli<\)  ■  xi  (Xjl“i)-  Xj{XiI“l)  "  (|xi'  XJ!|“l)  (2-31) 

Taking  i,  j<p,  k>p,  we  have  by  applying  definition  (2.15), 

(X±AX  |duj)-  -  ^[Xi,  XjUuf)  (2.23) 

and  this  vanishes  if  (X^,  X j ]  satisfies  (2.26)  hence  the  result. 
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Specializing  this  theorem  to  the  problem  at  hand  we  have: 


THEOREM  4  The  nondegenerate  diffusion  withSx  =  and  completely 

specified  by  the  s.d.e.,  dxt  =  f(xt)dt  +  g(xt)dwt>  where  f,  g  are  C°°, 
have  a  controllable,  linear  s.d.e.  representation  (on  Rn  )  of  the  form, 

dzt  =  Az^dt  +  bdwt 
if 

(i)  Q(0)**0 

(ii)  L(Q,g)  is  a  linearly  independent  set  of  vector  fields  on  a 
neighborhood  of  the  origin,  Uc  Sx> 

(iii)  the  dual  set  of  one  forms  to  L(Q,g)  given  by  -  K,-,  ,  has 

do)^  =  0  on  all  of  U  (i.e.,  is  closed). 


Proof:  Using  the  controllability  assumption,  pick  a  controller  form  for 
(A,b)  then  apply  Theorem  3. 

Remarks : 

Global  validity  of  the  transformation,  <J>(x)  ,  dealt  with  in  [13] 
through  study  of  the  Jacobian  of  4>  (x)  and  the  ratio  condition  on  the  non¬ 
characteristic  matrix;  in  essence,  determining  when  4>  is  1-1  on  Hn.  A 
more  succinct  approach  is  argued  as  follows: 

For  <j>(x)  to  exist,  with  arbitrary  controllable  pair  (A,b),  a  necessary 
condition  is  that  w.  be  a  set  of  closed  1-forms  (i.e.,  dot  =  0). 

The  integrability  question  posed  by  (2.29)  is  then  the  same  as  the 


question:  what  conditions  on  (Q,g)  assure  that 

n  30 

w.n  =  d0  =  l  dxJ 
L  —  n  "  3x. 

i  J 

for  some  0^:  Rn  ■+H  (i.e.,  that  is  exact)?  When  L(Q,g)  are  linearly 

independent  on  the  whole  of  Rn,  hence  is  a  linearly  independent  set 
everywhere,  then  a  global  equivalence  exists  whenever  duL1  = 0  by 


nrjr.'g.  tf,Mw  w 


»*  '  if  -JT'V"  W'mw  .  ■  t  Mfg  ■  »,i  »,"i  i  i  y  •'  ^  ■  'JP  '  <r .  /■ .  ^ 


virtue  of  the  converse  to  Poincare's  lemma  (see  Flanders  [11],  chapter  3). 
This  gives  the  aforementioned  simpler  test  in  the  following  result. 


t 


D 

a 


Corollary  1:  Let  the  process  {x^,  t>_0},  satisfying  (2.16),  have  state 
space  Sx  dif f eomorphic  to  Rn. 

Suppose  L(Q,g)  is  .a  linearly  independent  set  on  all  of  T(SX) 
and  let  denote  the  dual  to  this  set.  Then  a  necessary  and  sufficient 
condition  for  the  global  equivalence  of  the  s.d.e.s  (2.16)  and  (2.17)  is 
that  daL  =  0  on  all  of  A2(S  )  (i.e.,  for  all  x  e  S  and  any  XeT(S  )). 

Proof :  Because  Sx  is  dif f eomorphic  to  Rn,  note  that  Sx  has  a  global 
chart,  say  (p,  U) .  Using  the  proof  of  Flanders  for  the  lemma  concerning 
the  exactness  of  a  p-form  on  a  domain  U,  note  that  Rn  can  be  deformed  to 
a  point  (OeRn)  by  t(s,  x1,  x11)  =  (sx1,  sx^,  ••*,  sx11)  • 

n  ^  i 

Using  the  Induced  mapping  of  forms,  T*.  and  u"  *  |  a  (x  »••*,  xn)dx  ,  we  have 

L  1  i 

T*(“f)  =  l  a  (sx1,  •••,  sxn)d(sxX) 

L  j 

=  Y  a.  (sx1,***,  sxn)  (sdx1  +  x  ds)  ,  s£l,  I  =  [0,  1] 

1  1 

Now  define  the  mapping  K:AX(I  x  Rn)  A0  (Rn)  by 
K(a(s  .xjdx"1 )  =  0 
l 

K(a(s,x)ds)  =  J  a(s,x)ds 
o 

for  all  "smooth"  functions  a:  I  x  Rn  ■+■  R  . 

So 

K(T*  oj")  =  l  J  xia(sx1  ,  •  •  • ,  sxn)ds 
L  i=l  o 


-27- 


,v.v: 
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Remarks:  (1)  This  corollary  gives  a  simple  constructive  test  for  the 

existance  of  a  linear  s.d.e.  representation,  either  globally  or  locally 
depending  on  the  domain  on  which  L(Q,g)  is  non-singular.  There  does  not 
appear  to  be  any  simple  conversion  of  this  condition  to  one  directly 
upon  (Q,g).  It  is,  however,  straightforward  once  one  verifies  the 
linear  independence  of  L(Q,g).  Furthermore,  the  global  result  may 
breakdown  if  {x^,  t ^>0}  is  a  diffusion  restricted  to  a  manifold  other 
than  one  globally  dif  f eomorphic  to  B.n  (e.g.,  the  unit  sphere  Sn  c:  "Rn)  . 
The  problem  here  is  that  one  really  needs  a  global  chart  or  coordinate 
system  to  make  it  work  globally.  The  technical  point  being  that  the 
converse  to  Poincare's  lemma  fails  except  on  such  spaces;  one  then  needs 
to  consider  the  cohomology  of  1-forms  on  Sx.  Hence,  the  state  space 
Sx  must  be  globally  dif feomorphic  to  Rn,  possessing  only  natural 
boundaries.  In  particular,  Sx  must  be  a  simply  connected  open  subset 
of  Rn.  Otherwise,  Theorem  3  can  only  be  asserted  in  a  open  neighborhood 
of  the  origin;  in  geometric  language  on  a  single  coordinate  neighborhood 
of  a  chart.  With  the  restriction  that  Sx  be  dif feomorphic  to  Rn  and 
has  a  single  global  chart  then  we  need  not  specify  in  Theorem  4. 
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(2)  If  Sx  is  only  locally  dif feomorphic  to  Rn  (i.e.,  is  a  general 

differentiable  manifold)  then  Theorem  4  is  applied  to  each  chart  in 

the  open  covering  of  as  a  C°°-manifold .  Thus,  if  the  charts  are 

labeled  (u  ,  U  ) ,  we  must  find  a  corresponding  4>  =  6op  on  U  . 

a  a  T  a  a 

Piecing  together  these  4*^,  if  they  exist,  an  equivalent  linear  s.d.e. 
can  be  found  for  each  nonlinear  s.d.e.  representation  of  the  diffusion 
on  Sx-  It  may  also  turn  out  that  a  linear  s.d.e.  representation  will 
exist  only  on  a  subset  of  all  charts  in  the  manifold  covering. 


EXAMPLES 


(1)  Let  (xt,  yt,  t>0)  be  a  2-dimensional  process  satisfying  the  s.d.e. 


d  (  t) 
yt 


ax  +  6  y  e  yt  -  x2  eyt 

-e-yt~ 

t  L  t 

x  eyt 

dt  + 

0 

t 

dw^  . 


Applying  Theorem  4 


Q(x,y) 


L(Q,g) 


ocx  +  6  y  e  y  -  x2ey 

.y 


xe' 


e  y  e  y(xey  -  a) 


-1 


det  L(Q,g)  =  e 


r^y 

(xey  -  a) 

II 

3 

L  0 

-1 

II 

K  J 

and 

'  0 

i“V  ■ 


(ey  -  ey )dy  A  dx  =  0 . 


r  y  “i 

xey 

<p(x,y )  *  C(-A,b)  J  =  c  (-A,b)  _  _  . 


y  t* 

Then  under  the  transformation  (ut ,  v^)  =  (x^  e  ,  y^)  we  have  the  equivalent 


linear  representation. 


a  6l  ru  ~|  F 1 

,  .  dt  +  _  dw 

i  oj  vj  L°J  t 


Also,  although  clearly  unlikely,  if  the  accompanying  observation  were 


of  the  form 


s  *  (cl  xt  eyt  +  c2  yt)dt  4-  dvfc  , 


then  the  conditional  density  for  the  process  (xt,  yt)  given  (zs»  0 <  s <  t} 
could  be  determined  from  a  Kalman  filter  for  the  linear  representation 
of  the  problem. 

(2)  An  interesting  example  is  obtained  by  generalizing  example  3  of  Section 


2  to  2-dimensions. 


Let  q+(x,y)  =  (1  ±  x2  ±  y2)  and  set 


f (x,y)  = 


q  (x.g)  -  4x2  q2(x,y)  q,(x,y)  +  1 

y(— — 7 - r - )  +  — 7 - r  (4x2  (— — — 7 - r - 3)) 

q+(x,y)  q+(x,y)  q+(x,y) 

q  (x,y)  -  4y2  y  q2(x,y) 

X(  '  q+(x,y) - >  +  4q+(x,y)  ‘  4*  > 


q+(x,y)  -  2x‘ 

q+(x,y) 


q  (x.y) 


;(x,y)  = 


q_(x,y) 

q+(x,y) 
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t 

yt 

=  f(xt,  yt)dt  +  g(xt,  yt)dwt 

,  {wt  e  R}  , 

5 

with  initial  condition  x2  +  y2  <  1  ,  with  probability  1,  defines  a  diffusion 
process  on  the  unit  disc,  D2  =  {(x,y)|x2  +  y2  <  1}  for  all  t>0. 


Since  D2  is  dif fecmorphic  to  R2  one  would  suspect  that  this  process 
may  have  an  equivalent  linear  s.d.e.  description  on  R2 .  Indeed,  after 
very  tedious  computations,  Theorem  4  is  verified  such  that  dco^Cx.y)  =  0 


V(x,y)eD2  and  the  desired  transformation  is  found  to  be 

x 

u  .  _  .  _ 

VI  I  I  q  (x,y) 


analogous  to  example  3  of  section  2.  Under  this  transformation  the  linear 


4.  STANDARD  GENERALIZATIONS 


Multi-Input  Models 

The  counterpart  to  Theorem  4  for  multiple  independent  Brownian  motion 
driving  terms  follows  directly  from  standard  developments  of  multivariable 

linear  system  theory.  The  nonlinear  s.d.e's  to  be  dealt  with  have  the  form 

m 

dxt  =  f(xt)dt  +  l  gX(xt)dwit  ;  (2.33) 

i=l 

{wit,  t>0}  e  R,  i-1,  •••,  m  (m  <  n)  are  mutually  statistically  independent, 
standard  Brownian  motions.  f(x),  gi(x),  ...,  gra(x),  are  assumed  C00,  Borel 
measurable  mappings  (x)  E  Rn~*  Rn  and  the  diffusion  process  (x^,  t  >  0}  e  Rn 
satisfying  (2.33)  is  assumed  non-degenerate  with  only  natural  boundaries. 

Again,  as  in  the  previous  sections,  the  intent  is  to  determine  when  a 
smooth  mapping,  T,  exists  which  carries  the  nonlinear  representation  (2.33) 
into  an  equivalent  linear,  time-invarient  s.d.e.  representation: 

d7t  =  Aztdt  +  BdWt  (2.34) 

where  wt  represents  the  (column)  vector  value  Brownian  motion  with  components 
(wit,  •••>  wmt)*  8  =  (81>  •••»  bm)  is  an  n  x  m  matrix  of  constants  representing 
the  diffusion  constants  of  the  equivalent  representation. 

Because  of  the  lack  of  a  single  unique  choice  for  a  linear  realization 
in  multi-input,  multi-output  systems,  we  need  to  choose  a  "preferred"  form 
for  the  linear  target  pair  (A,B).  To  make  things  concrete  we  choose  the 
Popov  form  for  (A,B),  assuming  of  course  that  this  realization  is  controllable. 
Also,  the  earlier  remarks  concerning  the  state  space,  Sx,  of  the  nonlinear 
representation  of  the  processes  {x  ,  t^>0}  carry  over  to  this  situation  as 
well  so  there  is  no  real  restriction  to  assuming  =  Rn. 
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where  vlt,  v2t  are  statistically  independent  Brownian  motions  (standardized)  as 
are  wlt,  w2t.  The  envelope,  rt,  is  a  true  envelope  process  in  as  much  as, 
rt  >  0 V  t  with  Prob.  1  if  tq  >  U  with  Prob.  1. 

The  phase  process,  0 t ,  is  a  "3rd  order"  process:  we  model  the  “acceleration”  in 
phase  as  white  noise.  This  is  well  known  to  be  a  very  reasonable  model  for 
uncertain  phase  in  many  applications  without  further  apriori  information. 

(See  [1].)  Indeed,  under  any  reasonable  physical  interpretation  the  fluctuation 
in  phase  and  frequency  (i.e.,  uncertainty)  is  well  modeled  to  first  order  as 
linearly  dependent  upon  a  small  underlying  random  “phase  acceleration". 

With  respect  to  the  choice  of  model  for  the  envelope,  we  remark  that  the 
bilinear  form  we  propose  allows  a  relatively  wide  variety  of  intuitively  pleasing 
cases  by  varying  the  model  parameters  a  and  6  ,  which  are  viewed  as  fixed 
constants.  For  example  a  slowly  fading  or  growing  amplitude  effect  can  be  achieved 
(representing  an  advancing  or  receding  source),  as  well  as  a  constant  power  signal 
by  virtue  of  the  relation  [3]: 

E  {  r2(t)  }  =  E{r2}  exp  (2  (a-1/ 2  02)t  +  2B2E{  (vlt)  2  }) 

=  E{r’}  exp  (2  (X  +  02)t) 

This  model  also  has  another  motivation  which  comes  from  the  applicability  of  both 

the  linear  equivalence  transformation  theory  and  the  IQ  looked  loop  (IQLL)  approach. 

The  appendix  outlines  the  transformation  theory  and  indicates  its  application 

to  (3).  Here  we  simply  remark  that  the  envelope,  r(t),  has  the  representation 

(z  +  A  ) 

t  t 

rt  =  rQ  e  ,  rQ  >  o  with  probability  1 

where  2  c 

A  =  (t  -  j  0  and  Zj-  =  j^ixlvls 

By  ito's  ruLe  it  is  clear  that  drt  =  irt^t  +  hrcdvxt 
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2 .  Signal  &  Noise  Model 

A  variety  of  applications,  such  as  communications,  radar  and  sonar,  deal 
extensively  with  a  signal-in-noise  representation  of  the  form: 
z(t)  =  r(t)  cos  (to0t  +0(t))  +  n(t). 

Here  r(t)  is  the  envelope,  a  fixed  (possibly  unknown)  carrier  frequency,  6(t) 
a  phase  disturbance  or  uncertainty  and  n(t)  the  (wideband)  noise.  Assuming  wo  is 
sufficiently  known,  after  heterodyning  and  low  pass  filtering  we  typically  have 
the  "quadrature"  observations: 

yx(t)  =  r(t)  cos  0 ( t )  +  s  w2(t) 

(3.1) 

y2(t)  =  r(t)  sin  0(t)  +  s  w2(t). 

We  are  assuming,  of  course,  that  r,  0  are  sufficently  slowly  time  varying  and 
that  "s"  represents  the  noise  level  after  bandshif ting. 

In  many  signal  processing  situations,  passive  sonar  in  particular,  this 
representation  arises  in  an  obvious  way.  Furthermore,  in  many  ocean  environments, 
wx(t)  and  w2(t)  are  modeled  as  statistically  independent  Gaussian  noise  processes 
to  a  very  good  approximation.  This  is  the  basic  model  we  consider  here. 

For  convenience  we  will  henceforth  deal  with  the  "differential"  form  of  the 
observations : 


(3.2) 


dyt  =  d 

1 

Yj(t) 

y2(t) 

= 

r ^  cos 

r  sin  0t 

dt  +  s 

dwlt 

_dw2t_ 

where  wlt,  w2£  are 

statistically 

independent 

Brownian 

motions 

propose  as  a  state  dynamical  model  the  following: 
drt  =  arc  dt  +  0  rc  dvlt 


r\ 

V 

t 

to 

t 

= 

r 

_ 

0  L  0 
0  0  1 
0  0  0 


1 


u) 


dt  + 


0 

0 

q2 


dv2t  (3.3) 


-45- 


-.1  is  J  J  .■  1L 


Che  original  motivation  for  the  transformation  theory  of  the  previous  chapter. 

In  particular,  recent  work  [1,2]  suggests  that  in  a  problem  of  signal  phase  and 
frequency  tracking,  EKF's  formulated  in  terms  of  a  new  coordinate  system  for  the 
state  which  causes  the  observation  to  become  a  linear  functional  on  the  state 
variables  outperforms  the  usual  EKF  at  low  Signal-to-Noise  ratio  (SNR).  We 
investigate  the  present  conjecture  in  relation  to  these  results. 

In  section  2  we  define  and  justify  dynamics  and  observation  equations  for 
our  study.  These  are  useful  models  for  stochastic  narrowband  signals  in  additive 
noise.  Throughout  we  have  in  the  back  of  our  mind  the  goal  of  detection  of  these 
narrowband  sinusoid-like  signals  using  the  usual  quasi-coherent  approach.  Section 

3  discusses  the  discrete-time  filtering  algorithms  we  evaluate.  The  first 
algorithm  is  drawn  from  the  work  of  Johnson,  et.  al.  [1,2];  the  so-called  I-Q 
locked  loop  tracker.  The  second  algorithm  is  based  on  the  linear  equivalence 
transformation  of  the  previous  chapter  and  the  associated  conjecture.  In  section 

4  we  present  the  results  of  a  Monte  Carlo  analysis.  The  data  presented  suggests 
that,  depending  upon  the  SNR  regime,  the  conjecture  is  indeed  true,  although  not 
uniformly  so.  Finally,  an  Appendix  outlines  the  transformation  theory  applied 
to  the  I-Q  Locked  Loop  state  equations  studied  in  sections  2  and  3. 
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CHAPTER  3 


A  Linear  Equivalent  Tracking  Algorithm  for  the  I-Q  Locked  Loot 


1 .  Introduction 

In  tracking  and  recursive  filtering  problems,  linear  dynamics  and  linear  (in 
the  state)  observations  is  the  most  desirable  situation:  the  optimal  solution  is 
easily  obtained  by  realizing  a  finite  dimensional  filter  (Kalman-Bucy  filter). 

The  optimal  solution  behavior  is  well-understood,  much  analyzed,  and  computa¬ 
tional  procedures  have  been  thoroughly  documented.  In  the  previous  chapter 
it  was  shown  how  certain  nonlinear  dynamical  models  (of  the  state)  can  be  trans¬ 
formed  (pointwise)  into  wholly  equivalent,  completely  controllable  linear  models. 
When  the  transformed  observation  system  also  happened  to  result  in  a  set  of 
equations  linear  in  the  new  state  variables,  the  optimal  solution  for  this  linear 
system  (of  Kalman-Bucy  type)  could  then  be  transformed  back  to  the  desired  state 
estimates.  This  leads  to  a  realization  of  a  recursive  state  estimation  solution 
for  the  original  nonlinear  stochastic  differential  equation  (s.d.e.).  Unfortun¬ 
ately,  this  rather  special  happenstance  is  rare.  Thus  the  practical  significance 
of  the  transformation  theory  comes  into  serious  question.  On  the  other  hand,  it 
appears  to  be  relatively  easy  to  find  examples  of  practical  signficance  where  the 
signal  dynamics  model  indeed  satisfy  the  conditions  of  the  linearizing  trans¬ 
formation.  Hence  the  transformation  theory  may  have  some  practical  applications. 

In  view  of  these  observations,  it  is  of  some  interest  to  ascertain  if,  in 
the  case  of  these  "closer  to  linear"  problems,  somewhat  better  behaved  approximate 
recursive  filtering  algorithms  can  be  formulated.  This  Chapter  investigates  this 
conjecture:  In  the  case  where  system  dynamics  admit  an  equivalent  linear  repre¬ 
sentation,  irrespective  of  the  nonlinearity  in  the  corresponding  observation,  do 
Extended  Kalman  filters  (EKF)  based  on  the  linear  formulation  evidence  any 
improved  performance  over  directly  applied  Extended  Kalman  Theory?  Indeed  this  is 
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Combining  this  with  (2.38)  gives 


<j>'  (x){[Q,a]  (x)  +  (f*  °  <Kx))o(x)}  =  0 


[Q,o]  (x)  +  (f*  °  <p)  (x)a(x)  =  0 


Therefore,  using  (2.38)  and  substituting 


(o(x)-)  +  ~^~0(xjX)  =  b<Kx)  +  c  -*2(x> 

/Q(x) v  [Q,q] (x) 

Letting  V(x)  =  'o(x)J  0(x) 


V  (x)  -  c 


,  a=l,  b^O 


4>(x)  = 


-b±(b2-  4 (a2-  l)(c  -  V(x))1/2 
2(a2-  1) 


.  (a^l) 


rxj£_ 

J  0(0 


a=l ,  b=0 


Summarizing, 


(2.39) 


Equations  (2.36)  can  be  transformed  to  a  Benes  equation  with  drift  f(«)  iff 

(i)  l°(x)  I  >  0  Vx  eSx 

(ii)  for  some  at  least  twice  continuously  differentiable  function  4>(x) 
and  scalars  a,b  &  c, 

+  ■  <a2-  +  »♦<*)  +  <= 


In  this  case  4>(x)  is  given  by  (2.39)  and 


f  °  <Kx)  =  (§00} 
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APPENDIX  2.1 


Transformation  of  a  Nonlinear  s.d.e.  to  a  Benes  s.d.e. 

Discussion  is  restricted  to  the  scalar  case.  Consider  the  diffusion 
process  satisfying  the  s.d.e. 

dxt  =  m(xt)dt  +  a(xt)dwt  (2.36) 

where  (wt>  t >  0}  is  a  standard  scalar  Brownian  Motion.  Assume  m(x) ,  m'(x),  a(x)  , 
a' (x)  and  o"(x)  are  continuous  functions  on  Rand  |a(x)|  >0  V  xeR. 

The  purpose  here  is  to  find  a  state  variable  transformation,  4> ,  such  that 
ut  =  <(>(xt)  satisfies  the  Benes  equation 

du^  =  f(ut)dt  +  dwt  (2.37) 

where  f(u)  satisfies  a  Riccati  differential  equation 

f ' (u)  +  fz(u)  =  (au)2  +  bu  +  c  -  u2  (2.38) 

a,b,c  real  constants. 

Following  the  procedure  used  in  Section  2  and  equating  like  terms  gives 
the  equations 

4)'  (x)Q(x)  =  f  o  4>(x) 

4>’(x)g(x)  -  1 

Note  the  following  identities 

(<$)'  (x)Q(x))  ’  =  f '  (u)4>'  (x)  |u=(j)(x) 

(4)'  (x)o(x)) '  =  0 

(4> *  (x)a(x) )  'Q(x)  -  (4)’  (x)Q(x))  'o(x) 

=  4>'(x)  (Q,a](x)  =  -  f '  (u)4>'  (x)  |u=<})(x)0(x) 

=  -  f '  °  4>(x) 
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Theorem  5:  Let  (Q,g)  represent  the  set  of  vector  fields  generated  by  the 
iterative  bracketing  operation  in  Section  3  on  the  vector  fields  associated 
with  an  n-dimensional  s.d.e.,  as  in  (2.16).  Assume  Ln_i(Q,g)  does  not  satisfy 
the  conditions  of  Theorem  4.  Then  there  does  not  exist  a  mapping  <J> : Rn->-  Rn+In 
(m  >  0)  which  linearizes  the  s.d.e.. 

Proof : 

Since  Ln_i(Q,g)  is  singular,  assume  that  for  some  m  >0,  L  (Q,g) 

n+m—  1 

has  full  rank  and  let  <j>  be  an  at  least  twice  continuously  differentiable  map 
(x)  e  Kn+  Rn+m  .  Assume  that  zt  =  satisfies  a  controllable  linear 

s.d.e.  in  n+m  dimensions  with  controllable  realization  (A,b)  where 

(<**•  Wl<Q,8)}  '  C(-A’b)  • 

In  matrix  form 

(3  <p)L  .  .  (Q,g)  -  C(-A,b)  . 

x  n+m- 1 

Since  C(-A,b)  is  a  controllability  matrix  of  an  n+m  dimensional  state  space 
realization,  rank  C(-A,b)  =  n+m.  Also,  since  L  is  full  rank 

Ox<J>)  -  C(-A,b)LT(ILT)_l 
Hence,  (9X4>)  has  rank  at  most  n  and  therefore 

rank  C(-A,b)  *  rank  ( 3x<+>)Z- _<  n 

=*>  the  pair  (-A,b)  is  not  controllable  and  thus  is  a  contradiction  to  the 


hypothesis . 


5.  RELATED  MISCELLANEOUS  TOPICS 

The  nonlinear  stochastic  differential  equations  which  satisfy  the 
foregoing  transformation  theory  are  a  rather  special  subset  of  the  problems 
one  might  encounter  in  practice.  With  a  view  to  broadening  the  collection  of 
s.d.e.'s  which  might  be  studied  in  the  approximate  filtering  framework  studied 
in  the  next  chapter,  this  section  considers  two  further  questions: 

(1)  What  are  the  comparable  conditions  to  those  of  the  previous  sections 
with  which  to  recognize  when  a  given  nonlinear  s.d.e.  is  equivalent 
to  some  other,  perhaps  more  useful,  although  nonlinear  s.d.e.? 

(2)  Under  conditions  similar  to  those  of  the  previous  sections  is  it 
possible  to  find  nonlinear  s.d.e.'s  which  may  linearized  by  a  point- 
wise  transformation  to  a  higher  dimension  state  space? 

As  to  first  question  it  is  clear  that,  given  a  target  s.d.e.,  the  mapping 
from  a  general  nonlinear  s.d.e.  to  this  form  can  be  described  in  terms  of  a 
set  of  ordinary  differential  equations.  Conditions  for  which  this  set  of 
equations  may  be  solved,  if  they  are  known,  will  describe  the  desired  mapping. 
An  example  using  the  scalar  Benes  equation  [2]  is  given  in  Appendix  2.1.  The 
utility  of  this  example  is  unknown. 

With  respect  to  the  latter  question,  it  is  obvious  that  one  can  take  an 
n-dimenslonal  linear  s.d.e.  and  map  the  corresponding  state  variables  into 
another  set  of,  say,  n-m  variables  which  satisfy  a  nonlinear  s.d.e.  Such  a 
mapping  is  not  1-1  nor  will  it  be  invertible.  In  fact,  the  simple  argument 
which  follows  shows  that  an  n-dimensional  nonlinear  s.d.e.  cannot  be 
"controllably  linearized”  by  any  transformation  to  higher  dimension  if  it 
does  not  already  satisfy  the  conditions  of  Theorem  4. 
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B(xJ  + 


x2> 


6(x[ 


x2> 


0 


Vx) 


0  0 

~X2  X1 

8(x*  +  x*)  B(x*  +  x*) 


-1 

0 


Since 


3  x^  3  x2 

3x2  ^(x^  +  x2)}“  3xj  ^ (x j  +  x*)^ 


0 

0 

0 


it  follows  that  dwL  =  0  and  therefore  integration  yields  =  dQ^  where 

dn(x?  +  x2)  x 

0L(x)  -  col  ( - jg - ,  x^,  -  x3.  Tan  (— )). 

A  version  of  the  desired  transformation  is  then 

£n(x ?  +  x*)  _j  *2 

T(x)  «  col  ( - - >  Tan  (— )  •  xj»  *4) 

with  the  corresponding  equivalent  linear  s.d.e.  representation 


“0 

0 

0 

0  " 

~  1 

o' 

0 

0 

-1 

0 

Z  dt  + 

0 

0 

dwlt 

0 

0 

0 

-1 

t 

0 

0 

dw2t 

0 

0 

0 

0_ 

0 

1 

— 

_ 

_ 

This  example  is  used  in  Chapter  3  to  study  an  approximate  filtering  application 
of  the  transformation  theory  developed  in  this  chapter. 


where  8^0.  Applying  corollary  2  we  have 


Ax^  -  x^x2 

6x1 

1 - 

0 

1 

Xx2  -  x^x^ 

gL(x)  = 

8x2 

g2(x>  = 

0 

X4 

0 

0 

1 - 

0 

1 _ 

0 

1 

1  , 

where  ,  a  -  y  £  • 

[Q,gl](x)  =  0  =  0  V  m>  1) 

[Q,g2l(x)  =  col  (0,0, -1,0) 

Lp(g2)(x)  =  col  (-x2,  xlf  0,  0) 

Lp(g2)(x)  =  0 


Hence,  the  Kronecker  Indices  are  kj.  -  1, 
vector  fields  L2  is, 

8x^  0 

8x2  0 

0  0 

0  1 


k2  *  3  and  a  representation  for  the 


-1  0 

0  0 


L2(x)  = 


L2(x)  Is  full  rank  everywhere  except  at  x^  =  X2  =  0.  If  the  diffusion 
process  represented  by  (2.35)  is  assumed  to  be  generated  from  an  orthogonal 
representation  of  a  "narrowband"  signal,  with  x^t  and  X£t  representing  the 
quadrature  components  (c.f.  Chapter  3,  section  3),  then  it  follows  that 
Xlt  +  x2t  >  0  for  all  t>0.  With  this  L2(x)  is  everywhere  l  rank 
(on  the  state  space)  and  the  coefficients  of  the  first  dual  to  L2(*)  are,  in 


matrix  form. 


■J,  ,  -  -v  v-  »  7  -  . 


X,  * 

forms  dT  with  the  vector  fields  L^J(gj),  j“l,  •••»  m,  JL  =  0, 

1»  *  *  * »  k j  and  use  of  a  Popov  form  for  the  pair  (A,B)  with 

Kronecker  indices  (k.  ,  k  ). 

1  m 

(2)  The  strong  conditions  in  (ii),  k  >  0  and  at  least  one  j  for 
which  k^>l,  can  be  relaxed  provided  that  for  all  j  such  that 
kj  =  0,  g_.  (x)  may  be  represented  by 

8^(x)  =  )>  a„gn(x),  a„  scalars. 

3  U|  V0}  a 

An  Example 

The  following  example  is  motivated  by  a  realistic  signal  tracking  problem 
studied  in  the  next  chapter.  The  nonlinear  state  equations  are  as  follows: 

4 

Let  xt  =  col  (xu,  x2t,  x3t,  x^)  e  K  , 


dx 

t 


ax 


It 


ax 


2t 


X2t" 

Pxlt 

0" 

Xlt 

dt  + 

Sx2t 

0 

d»lt 

0 

0 

0 

1 

dw2t 

_ 

_ 

_ 

(2.35) 
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With  these  preliminaries  the  multi-input  result  is  stated  in  the  following 
Corollary  to  Theorem  A: 

Corollary  2;  Let  (xt>  t > 0 ,  x £  Rn  }  be  the  non-degenerate  diffusion 
process  satisfying  the  non-linear  s.d.e.  (2.33).  Define 
Q(x)  *  f(x)  -  y  I  (dgi(x),  gi(x)^  • 

There  exists  a  simply  connected  open  domain,  and  a  smooth 

transformation  T:U-*Rn  such  that  zt  =  T(xt)  has  the  representation  (2.3A) 
if  and  only  if : 

(i)  Q(o)  =0,  0  £U 

(ii)  there  exist  positive  integers,  k^  >0,  j=l,  2,  ....  m, 
such  that, 

m 

l  ki  =  n 
1  J 

with  at  least  one  l£j^m  such  that  k^  >  1  and 

Lm  =  {gl’  ***’  LQ1_1(81)*  V  *•*’  LQm'lK)} 

form  a  linearly  independent  set  on  a  simply  connected  open  domain 
U  c  j^n  containing  the  origin 

(iii)  the  duals  to  L  ,  ojt  =  (L  Sdx,  are  closed  on  U  (i.e.,  dui  =0  V  £  U) 

m  L  m  —  —  L 

The  required  transformation,  T,  in  this  case  is  obtained  by  choosing 
a  Popov  form  (A,B)  with  Kronecker  indices  kj  ,  j=l,  2,  . . . ,  ra, 
computing  the  controllability  matrix  for  (-A,B)  and  performing  the 
integration 

T(x)  =  J  C  (-A ,  B)u>  . 

(O.xKu 

Remarks : 

(1)  The  proof  of  Corollary  2  follows  directly  from  Theorem  A 

after  identification  of  the  required  inner  products  of  the  1- 


Under  the  IQLL  framework  the  coordinate  representation  of  interest  is  that 
which  causes  the  observation  to  depend  linearly  upon  the  "signal".  Therefore, 


it  =  rt  cos  6 
qt  =  rt  sin  et 


whence , 


dyt  = 


dt  +  s 


dw 


It 


dw 


2t 


(3.4) 


The  state  dynamics  equations  therefore  become  nonlinear  in  the  new  state  variables: 


—  — 1 

£t 

ait  -  Vt 

61t 

0 

qt 

= 

aqt  +  Vt 

dt  + 

6qt 

0 

wt 

Wt 

0 

0 

Jt_ 

0 

0 

_ 

q2 

dv 


It 


dv 


2t 


(3.5) 


Note  that  even  when  a  =  0,  the  case  studied  in  [1],  (3.5)  would  remain  in  coupled 
bilinear  form. 

From  the  appendix,  the  equivalent  form  for  the  "log  normal"  diffusion  model 
given  by  (3.5)  is 


—  — 

z 

t 

0 

t 

a) 

t 

Jt_ 

0  0  0  0 

zt 

B  o 

0  0  10 

e. 

0  0 

L. 

dt  + 

0  0  0  1 

wt 

0  0 

0  0  0  0 

_^t_ 

0  q2 

dv 


It 


dv 


2t 


(3.6) 
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which  is  completely  linear  in  the  (new)  state  coordinates  (z,  8,  u,  ()■  Of 
course  here  the  observation  remains  nonlinear  in  the  states  and  is  seen  to  be 


dyt 


(zt  +  Xt) 

r  e  cos  0 

o  t 

(z  +  A  )  dt  +  s 

r  e  C  sin  8_ 

o  t 


(3.7) 


These  two  alternative  representations  provide  the  starting  point  for  our 
comparison  of  approximate  filtering  algorithms  in  response  to  the  conjecture 
posed  in  section  1.  The  next  section  develops  appropriate  discrete-time 
approximations  and  discusses  the  Monte  Carlo  analysis  set-up. 


^  v" 


3.  Development  of  the  Filtering  Algorithms 

We  restate  the  partial  observation/filtering  problem: 
State  Dynamics: 


dv 


—  — 

—  — 

—  — 

—  — 

r 

a  0  0  0 

rt 

Brt  0 

e 

0  0  10 

0t 

0  0 

= 

dt  + 

OJ 

0  0  0  1 

0  0 

K 

0  0  0  0 

1 

sTn 

rt 

1 _ 

- 1 

c r 

o 

_ 1 

It 

dv2t 


(3.8) 


Observation: 

(3.9) 

where  (rg,  0g,  u>g,  £g)  are  statistically  independent  of  w^t>  w2t*  Vlt*  V2t* 
t  _>  0  and  the  latter  are  statistically  independent  Brownian  motions  (stan¬ 
dardized).  We  assume  that  rt,  0t ,  u>t  and  £t  are  statistically  independent 
of  the  observation  noises,  t  ^  0. 

Under  the  two  alternative  approaches,  discussed  in  section  2.,  one  can  now 
formulate  two  extended  Kalman  filters.  From  the  linear  equivalent  system  the 
nonlinearity  of  the  problem  is  isolated  in  the  observation  (c.f.,  egns.  3.6  &  3.7) 


*2t 


rt  cos  0t 
rt  sin  0t 


dt  +  s 


dw.it 

dw2t 


whereas  in  the  1QLL  coordinates  1  the  nonlinearity  has  been  isolated  in  the 
state  dynamics  equations  (c.f.,  egns  (3. A)  &  (3.5)).  The  corresponding  extended 
Kalman  filters  for  these  two  representations  are  as  follows: 

(a)  Linear  Equivalent  Form 


— 1 

Si  > 

_l 

1 

o 

o 

o 

o 

1 _ 

Z 

,1  z  +  Xt  a  A. 

t 

t 

ay  -re  costt  at 

A 

o 

0 

0  0  10 

0 

t 

= 

t 

dt  +  K* 

A 

t 

,2  z  +  At  .  £  . 

CO 

0  0  0  1 

U) 

dy  -re  sin0  dt 

t 

t 

o 

0  0  0  0 

f.t  _ 

_£t_ 

-  — 

or  more  compactly. 


i.-  .i  . 
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and  R  1  =  — 2  I  (I  is  the  2x2  identify  matrix) 


It  is  easy  to  see  that  this  equation  decouples  into  a  lxl  equation  for  the 
"envelope"  variance  and  a  3x3  equation  for  the  "phase  error"  covariances.  TV 


P*  =  Pr  =  P’ 
z  r  i 


are 

given  by  P* 

=  r 
— 

6s  | 

r 

2a 

2a2 

a3 

2a2 

3a3 

2a4 

a3 

2a4 

2a5 

where  a  H. 


„  2  C.  2 

r 


l/6 


Finally, 


P*1 


2a 


K.*  =  P*HT  (Xle)  R_1 


2a2 


s2S2 


a" 


s2S2 


r  (cos  8  t  sin  0  ) 
r  (-sin  0 ,  cos  0  ) 
r  (-sin  0 ,  cos  0 ) 

A  A 

r  (-sin  ® ,  cos  6  ) 


and 


KIQ  ~  ^  K* 


IQ'  LEJ 


where  J  ( X )  represents  the  Jacobian  matrix  of  the  transformation  from 
coordinates  to  the  IQ  coordinates  and  is  given  by 

/“v  A 

I  -Q  0  0 

Q  I  0  0 

0  0  10 

0  0  0  1 

A 

zt  +  At 


J  <xiq/xle>  = 


for  I 


+  At 


COS  0  -  0^  Sr  e 


sin  0  .  Fron)  this 


x  > 


r2  +  Yl 


*IQ  = 


<  O’ 

<  M 

2a 

s2S2 

2a2 

A 

Q 

OJ 

a 

CN 

s2S2 

a3 

/*« 

n 

a3 

s2S2 

s2S2 

r2  +  YQJ 


Above,  the  expressions  used  to  simplify  notation  are 
S2  -  r  /s2  and  Y  -  72  {  F^r  fr  }  . 


In  the  IQLL  filter  in  [1],  the  apparent  "small”  signal  approximation  is  introduced 
to  decouple  the  IQLL  estimation  equations  as  follows: 


p*  s  2a/s2 
1 1 


for  a  <  <  1,  (r2  <  <  —  )  . 


Using  this,  and  the  gain  matrices  above,  the  two  algorithms  are, 
Linear  Equivalent: 

I  8  I 

d£t  “  _  {  cos  9t  dy  it  +  sin  6  dy2t  -  rt  dt  } 


6C  010  9 1 


d  u) t  “  0  0  1  dt  +  2a 

A 

0  0  0  lt  a 

2t  +  At 


{-sine^  dy  !  t  +  cos0t  dy2t) 

-  '  '  - /V 

rt 


(3.10) 
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4.  Filter  Comparisons 

The  two  filtering  algorithms  were  compared  using  Monte  Carlo  simulations. 

Each  filter  was  given  identical  input  data  generated  according  to  the  discrete 

time  approximation  to  (3.8)  and  (3.9).  Simulations  were  performed  for  of  signal- 

to-noise  ratios  ranging  from  -13B  to  +20dB.  For  the  purpose  of  comparison, 

signal  to  noise  ratio  was  computed  as  the  integrated  average  signal  power  over 

the  sample  contained  in  the  observation  (3.9)  divided  by  the  noise  variance 

,  tx+T 

EH;f  r2  (t)dt } 

CWR  =  T  1 1 


In  Eqs  (3.8)  and  (3.9),  signal  dynamics  and  observations  were  approximated  using  a 
4th  order  stochastic  Runge-Kutta  scheme  [6]  at  a  sample  spacing  of  0.05  seconds. 


Because  the  bilinear  Ito  equation, 
dr  =  ardt  +  grdvlt 

can  be  solved  explicitly  (rt  *  ro  e  Vit  +  ^ *  >>  =  at-  ^  6  ) 

this  solution  was  used  to  generate  the  sample  paths  for  rt  (rn  =  rtn,  n  =  1,2,...). 
The  phase  variables  employed  the  stochastic  integration  routine. 

Monte  Carlo  runs  consisted  of  generating  sample  paths  of  length  240  seconds 
each  using  a  white  Gaussian  random  number  generator  whose  correlation  time 
corresponded  to  2^1  samples  (i.e.,  the  pseudo-random  numbers  are  guaranteed  not 
to  repeat  identically  over  any  contiguous  set  of  2^1  values).  A  corresponding 
ensemble  of  sample  paths  was  produced  for  each  signal  in  noise  (SNR,  etc.) 
condition.  Each  such  ensemble  was  then  processed  using  a  discrete-time  approxi¬ 
mation  to  the  two  filtering  algorithms  (a  predictor-corrector  form  in  accordance 
with  [1]).  The  output  from  the  filtering  routines  were  estimates  of  the  original 

A  A  A  A 

states  (rn,  9n,  wn,  £n);  n  =  1,2,...  .  The  filter  implementations  are  given  as 
follows: 
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Linear  Equivalent  Form: 


Zn  =  Zn_1 
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+  A 

•  4.  12 

w  n-i 
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S 

-  [cos  0n  6  y 
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0 n  =  en  +  2an  [-sin0n  6  +  cos  0n  6  y^  }/rn 
wn  =  wn  +  2an  L-sin  9n  6  yn  +  cos  6n  6  yn  J  /rn 

A 

*“**  Z 

l n  =  l n  +  (ii)  t -sin  en  5  yn  +  cos  Qn  <5  Yn  1  &  =  rn  e  n 


correction 


where 

6  Yn  =  yn  "  Yn-i*  <5  Yn  =  yn  ~  yft-i 
and  an  =  (<l2rn')l/3 


IQLL  Form 

[V 


cos  A0  -sin  A0 
sin" AS  cos  AS 


0  £n-i 


+  2cxn 


On  =  (<\l  rn)1/,&  ,  r„ 

~~P~ 

l  -  (j;  ♦  o 1/2 


prediction 


6yn  -  InA 
^yn  “  QnA 


^-Qn6  yn  +  ^n6  yn 


-2  ~2 

In  +  ^n 


correction 


l  XM.  A.  -.A.- 


-  '■  -  a2  ~ 

and  A8  =6  -  0  =  A  •  w  ,+  -77-  £n-l  • 

n  n  n-1  n-1  2 

For  the  IQLL  filter,  to  compare  phase  estimates  between  the  two  algorithms, 

6n  was  taken  as 

8n  =  0n  +  A0n  +  2an  [ -Qn  <Syn  +  In<Sy2]/  r2  , 
since  the  IQLL  does  not  produce  these  directly. 

Mean  Square  Error  Comparison 

The  Monte  Carlo  comparison  of  the  two  filtering  algorithms  are  summarized 
below.  The  comparisons  are  based  on  simulations  for  which  the  envelope  power, 
E(rt2},  was  time  independent;  for  this  situation  choose  ct,B  such  that  A  =  -  B2 . 

This  allowed  an  essentially  stationary  representation  of  signal-to-noise  ratio 
thoughout  the  numerical  comparisons. 

Figures  3-1  thru  3-4  show  typical  results  for  the  two  algorithms  for  estima¬ 
tion  error  (mean  square)  as  a  function  of  input  SNR.  Only  the  results  for 
envelope  and  phase  are  shown  as  both  algorithms  perform  equivalently  in  terms  of 
mean  square  error  for  frequency  (oj)  and  frequency  rate  (£)•  The  IQLL  algorithm 
did  show  up  to  0.5  dB  improvement  in  frequency  estimation  error  in  a  few  cases, 
but  not  consistently. 

Figures  3-1  and  3-2  show  the  typical  situation  for  the  phase  and  envelope 
estimates  respectively.  These  figures  also  show  the  amplitude  known  (Figure  3-1) 
and  phase  known*  (Figure  3-2)  performance,  respectively,  for  "lower  bound"  com¬ 
parisons.  These  are  the  best  possible  estimates  to  be  achieved  from  either 
algorithm  in  these  situations.  In  both  instances,  the  linear  equivalent  approxi¬ 
mation  appears,  generally,  to  outperform  the  IQLL  filter  and  performance  advantage 


Note:  In  Figure  3-2,  the  phase  known  case  represents  the  optimal  envelope 
estimation  performance;  the  linear  equivalent  approach  reduces  to  the  optimal 
estimate  for  rt  in  this  situation  (within  experimental  and  discretization 
error),  whereas  the  IQLL  formulation  does  not. 
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is  clearly  a  function  of  input  SNR.  The  best  advantage  for  phase,  within  the 
region  of  acceptable  error,  (<  1  rad),  appears  to  be  less  than  about  5-7dB  and 
consistently  occurs  just  above  the  transition  SNR.  The  error  advantage  of  the 
linear  equivalent  algorithm  for  envelope  estimates  can  be  quite  large  being  a 
function  of  SNR  and  envelope  state  noise  power.  Figure  3-2  shows  up  to  about  3dB 
improvement  in  estimation  error  which  appears  to  be  the  typical  improvement  over 
the  range  of  reasonable  dynamics  noise  levels. 

Figure  3-5  presents  an  aplications  oriented  comparison  of  the  phase 
estimation  performance  of  the  two  algorithms.  This  figure  plots  the  input  SNR  at 
which  the  phase  errors  are  below  an  acceptance  threshold  of  30°  RMS.  Although  30° 
is  arbitrary,  it  is  a  reasonable  criterion;  errors  such  in  excess  of  this  value 
make  the  phase  estimates  essentially  useless  for  any  tracking/detection  scenario. 
The  independent  variable  in  the  figure  is  taken  to  be  the  phase  dynamics  noise 
level,  q2  in  egn.  (3.8);  | q2 |  “  0  represents  the  phase  known  case  and  increasing 
| q2 |  represents,  in  a  loose  sense,  the  degree  of  inherent  randomness  in  the 
phase  process.  With  this  interpretation,  figure  3.5  indicates  about  a  ldB 
"threshold"  advantage  for  the  linear  equivalent  algorithm  over  the  range  of  | q2 | 
studied . 

Finally,  we  note  that  at  and  below  transition,  results  tend  to  be  mixed, 
perhaps  reflecting  large  statistical  uncertainty  associated  with  low  SNR  and 
large  errors.  Additionally,  as  SNR  decreases,  the  linear  equivalent  algorithm 
diverges  more  rapidly  than  the  IQLL.  This  reflects,  perhaps,  dominating  effects 
of  the  observation  nonlinearity  of  the  linear  equivalent  algorithm  in  this  regime 
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•j 

•i 

a 

1 


'•I 

u 


of  error  (and  SNR). 


Sensitivity  to  Initial  Conditions 

Experiments  performed  to  discover  the  sensitivity  of  the  two  algorithms  to  the 
assumed  initial  state  are  summarized  in  Table  3-1.  This  data,  in  Table  3-1  (a)  - 
(d),  indicates  the  approximate  time  for  convergence  of  the  filter  state  estimates 
under  the  conditions  of  randomized  initial  state  estimates.  The  controlling 
parameter  in  these  data  is  the  initial  state  estimate  variance  for  each 
component,  (r0,  0O,  0Jo,  £0), relative  to  the  actual  value.  When  convergence 
was  not  achieved  after  4  minutes  the  filter  was  assumed  not  to  converge.  Note 
that  although  one  state  estimate  does  not  converge  the  others  may. 

The  IQLL  algorithm  was  much  less  sensitive  to  initial  amplitude  than  the 
Linear  Equivalent;  whereas  exactly  the  opposite  was  the  case  for  initial  phase 
estimates.  Frequency  and  frequency  rate  sensitivities  appear  to  be  approximately 
equivalent  for  the  two  filters.  The  sensitivity  of  the  IQLL  to  initial  phase 
estimation  accuracy  would  seem  to  be  the  consequence  of  the  use  of  I,Q  instead  of 
a  phase  coordinate  directly  and  the  fact  that,  for  both  algorithms,  initial 
amplitude  error  appears  to  be  relatively  insignificant  (e.g.,  or^  5  160.0  and 
40.0).  This  is  also  probably  related  to  the  "observability"  of  the  0  coordinate. 
The  robustness  of  the  Linear  Equivalent  algorithm  to  initial  phase  estimate 
appears  to  be  a  consequence  of  the  coordinates  using  the  phase  variable  directly 


in  estimation. 


As  a  final  note,  these  sensitivity  analyses  were  performed  such  that  in¬ 
dividual  state  variable  effects  were  "isolated".  That  is  when  one  state  initial 
condition  was  varied,  the  other  state  variable's  initial  conditions  were  fixed  at 
the  a  priori  known  values.  Composite  effects  are  less  clear  but  seem  to  be 
characterized  by  an  ellisoidal  region  defined  by  the  "isolated"  variable  sensi¬ 
tivity  boundaries,  (c.f.  Table  3-2). 
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Figure  3-1  MS  Phase  Estimation  Error  Comparison;  Linear  Equivalent  versus  1QLL 
Approximate  Filters  with  amplitude  known  lower  bounds  (X  •  -O.Ul, 


Filters  with  phase  known  lower  bounds  (Xm  -.01, 


(Advancing  target  simulation) 


RMS  ERROR  THRESHOLD  SICNAL-TO-NOISE  LEVEL 


INPUT  RMS  PHASE  "NOISE"  LEVEL  (JB) 


Figure  3-5 


30*  RMS  Phase  Error  Cut-off  Threshold  Comparison;  Linear  Equivalent 
versus  IQLL  (X-  -0.01,  8-  0.1):  Input  SNR  versus  qj. 
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TABLE  3-1 

(a)  Estimator  Sensitivity  to  Initial  Amplitude  Variable  Uncertainty 
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(c)  Estimator  Sensitivity  to  Initial  Frequency  Variable  Uncertainty 
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(d)  Estimator  Sensitivity  to  Initial  Frequency  Rate  Variable  Uncertainty 
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TABLE  3-2 


5. 


Conclusion 


An  alternative  approach  to  "amplitude"  and  phase  tracking  from  quadrature 
observations  has  been  presented  and  experimentally  investigated.  The  motiva¬ 
tion  for  the  new  filtering  algorithm  comes  from  the  notion  of  "closer  to  linear" 
evolution  equations  based  on  linear  equivalent  dynamics  using  a  Transformation 
theory  for  nonlinear  diffusion  models.  Discrete-time  approximate  filtering 
equations  were  formulated  for  a  novel  envelope/phase  narrowband  signal  model 
which  should  prove  useful  in  real  tracking  applications  in  passive  sonar. 

Through  Monte  Carlo  analysis  it  was  shown  that  the  linear  equivalent  formulation 
outperforms  the  IQ  locked  loop  algorithm  for  estimating  "amplitude"  and  phase 
in  additive  noise:  as  much  as  3dB  (envelope)  and  4-6dB  (phase)  improvement 
were  demonstrated.  This  comparison,  however,  required  the  augmentation  of  the 
standard  IQLL  algorithm  with  a  fifth  state  variable  (i.e.,  phase).  In  fairness 
to  the  IQLL  approach,  which  is  specifically  formulated  to  avoid  this  direct 
estimation  of  phase,  estimates  of  the  in-phase  and  quadrature  coordinate 
variables  were  also  compared.  It  was  found  that  the  linear  equivalent  algorithm 
provided  as  good  or  better  (up  to  1.5dB  better)  estimates  even  for  these 
variables  as  did  the  IQLL  algorithm.  Finally,  the  IQLL  appears  to  be  (overall) 
somewhat  less  sensitive,  in  its  natural  coordinates,  than  the  linear  equivalent 
to  accuracy  of  assumed  initial  estimates. 

Whether  the  application  in  mind  is  for  quasi-coherent  sequential  detection 
or  phase/frequency  tracking,  the  linear  equivalent  approach  appears  to  offer 
distinct  advantages  in  terms  of  mean  square  error  performance.  This  confirms 
our  original  conjecture  about  the  dominant  behavior  of  nonlinear  dynamics  in 
the  higher  SNR  regime.  On  the  other  hand,  when  dynamics  modelling  errors  may 
be  significant  the  IQLL  is  the  preferred  approach  as  the  linear  equivalent  is 
rather  sensitive  to  the  underlying  model  parameters  through  the  Kalman  gain. 


APPENDIX  3.1 


The  Linear  Representation  for  the  IQ  Locked  Loop  Dynamics 


The  application  of  Corollary  2,  Chapter  2  to  Eqn.  (3.o)  is  a  matter  of  computation 
as  shown  below 


Letting  It  -  rf  cos  9^  =  rfc  sin  6^  equation  (3.5)  is  reiterated  as 


_It' 

aIt  - 

i 

•u 

O' 

3 

~6It 

0 

«t 

ccQt  + 

Wlt 

dt  + 

BQt 

0 

“t 

vP'l 

rr 
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0 
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^2 

_  _ 

_ 

_ 

L 

_ 

dv 


It 


dv 


2t 


(A-l) 


Note  that  this  is  just  the  example  of  Chapter  2,  Section  4.  Following  that  example 


U  = 


Xl  -  wQ 
XQ  +  to  I 

c 

0 


gL  =  col  (I,  Q,  0,  0) 
g9  =  col  (o,  o,  o,  i) 


tu.g.l  =  o  L  (g.)  =  0 


V  m  >  1 


[U,g2]  =  col  (0,  0,  -  q2>  0) 

L2(g2)  =  q2  co1  (  -  Q»  i»  °»  °) 


LX>  =  0 


The  Kronecker  indices  are  kj  =  1,  k2  =  3  and 


L2(x)  = 


det  (L2(x)) 
1-forms  are 


=  -  Bq~  (I2+  Q2)  =  -  gq9  r2^  0  for  any  1 3r  |  >0  and  q„  ^  0.  The  dual 


WL(X)  = 


_L_  _2_  o 

Br  2  Br2 


0  0  1/q, 

0  -l/q2  0 


q9r2  q.r2 


and  dcj  =  0  V  I ,  q  =*  oj  (x)  =  d0  (x) 

L  L  L 

i.e.,  may  be  integrated  up  exactly,  0j  (x)  =  /  0)^(0.  and  one  finds 


Y  In  (I2+  Q2) 
Tan_1 (Q/I) 


ln(r) 


Thus  the  equivalent  linear  system  is 

X 

Wt 

*t 
0 

Letting  Z  =  Z'  -  Xt  gives  equation  (3.6), 


(A-2) 


In  this  case  the  observation  equation  is  now  of  the  form: 


( A-3) 


which  is  equation  (7).  Of  course,  the  same  result  is  obtained  by  recognizing  that 
(r^,  t  > 0}  is  a  "log  normal"  process  related  to  a  scaled ,  time  shifted  Brownian 
motion.  However,  the  equivalence  transformation  theory  provides  a  systematic  method 
in  the  general  case.  In  this  sense,  it  is  a  tool  of  some  value. 


(*)  Note  here  that  this  transformation  results  only  in  a  local  equivalence, 

{0  1 0 1  <<  £•} .  This  is  not  a  difficulty,  coming  only  from  the  choice  of  the 
(I,Q)  coordinates  to  start  with.  By  beginning  the  application  of  Corollary  2 
with  the  system  in  (3.3)  one  finds  a  globally  equivalent  linear  diffusion  of 
the  same  form  as  in  (A-2) . 
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